A Horizontal Alignment Tool for Numerical Trend Discovery in Sequence Data: Application to Protein Hydropathy
نویسندگان
چکیده
An algorithm is presented that returns the optimal pairwise gapped alignment of two sets of signed numerical sequence values. One distinguishing feature of this algorithm is a flexible comparison engine (based on both relative shape and absolute similarity measures) that does not rely on explicit gap penalties. Additionally, an empirical probability model is developed to estimate the significance of the returned alignment with respect to randomized data. The algorithm's utility for biological hypothesis formulation is demonstrated with test cases including database search and pairwise alignment of protein hydropathy. However, the algorithm and probability model could possibly be extended to accommodate other diverse types of protein or nucleic acid data, including positional thermodynamic stability and mRNA translation efficiency. The algorithm requires only numerical values as input and will readily compare data other than protein hydropathy. The tool is therefore expected to complement, rather than replace, existing sequence and structure based tools and may inform medical discovery, as exemplified by proposed similarity between a chlamydial ORFan protein and bacterial colicin pore-forming domain. The source code, documentation, and a basic web-server application are available.
منابع مشابه
An Application of the ABS LX Algorithm to Multiple Sequence Alignment
We present an application of ABS algorithms for multiple sequence alignment (MSA). The Markov decision process (MDP) based model leads to a linear programming problem (LPP), whose solution is linked to a suggested alignment. The important features of our work include the facility of alignment of multiple sequences simultaneously and no limit for the length of the sequences. Our goal here is to ...
متن کاملDiscovery of Novel Peptidomimetics for Brain-Derived Neurotrophic Factor using Phage Display Technology
Brain-Derived Neurotrophic Factor (BDNF) is a neuroprotectant candidate for neurodegenerative diseases. However, there are several clinical concerns about its therapeutic applications. In the current study, we selected BDNF-mimicking small peptides from phage-displayed peptide library as alternative molecules to the clinical challenges. The peptide library was screened against BDNF receptor (Ne...
متن کاملENDscript: a workflow to display sequence and structure information
UNLABELLED ENDscript is a web server grouping popular programs such as BLAST, Multalin and DSSP. It uses as query the co-ordinates file of a protein in Protein Data Bank format and generates PostScript and png figures showing: residues conserved after a multiple alignment against homologous sequences, secondary structure elements, accessibility, hydropathy and intermolecular contacts. Thus, the...
متن کاملMPEx: a tool for exploring membrane proteins.
Hydropathy plot methods form a cornerstone of membrane protein research, especially in the early stages of biochemical and structural characterization. Membrane Protein Explorer (MPEx), described in this article, is a refined and versatile hydropathy-plot software tool for analyzing membrane protein sequences. MPEx is highly interactive and facilitates the characterization and identification of...
متن کاملMelody discrimination and protein fold classification
One of the greatest challenges in theoretical biophysics and bioinformatics is the identification of protein folds from sequence data. This can be regarded as a pattern recognition problem. In this paper we report the use of a melody generation software where the inputs are derived from calculations of evolutionary information, secondary structure, flexibility, hydropathy and solvent accessibil...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 9 شماره
صفحات -
تاریخ انتشار 2013